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Abstract 

A prominent feature of gene transcription regulatory networks is the presence in 
large numbers of motifs, i.e, patterns of interconnection, in the networks. One such 
motif is the feed forward loop (FFL) consisting of three genes X, Y and Z. The 
protein product of x of X controls the synthesis of protein product y of Y . Proteins 
x and y jointly regulate the synthesis of z proteins from the gene Z. The FFLs, 
depending on the nature of the regulating interactions, can be of eight different types 
which can again be classified into two categories: coherent and incoherent. In this 
paper, we study the noise characteristics of FFLs using the Langevin formalism and 
the Monte Carlo simulation technique based on the Gillespie algorithm. We calculate 
the variances around the mean protein levels in the steady states of the FFLs and 
find that, in the case of coherent FFLs, the most abundant FFL, namely, the Type-1 
coherent FFL, is the least noisy This is however not so in the case of incoherent 
FFLs. The results suggest possible relationships between noise, functionality and 
abundance. 

Keywords: feed forward loop, stochastic gene expression, noise, gene transcription 
regulatory network, Langevin formalism, Gillespie algorithm. 



1. Introduction 

Biological networks represent the complex webs of biomolecular interactions and reac- 
tions underlying cellular processes. Well-known examples of biological networks include 
metabolic reaction, protein-protein interaction and gene transcription regulatory networks 
(GTRNs) [HE]- The availability of large scale experimental data and powerful computa- 
tional tools provide information on the structural and functional features of the complex 
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Figure 1: Eight types of FFLs: (a) Type-1, (b) Type-2, (c) Type-3, (d) Type-4 coherent 
FFLs, (e) Type-1, (f) Type-2, (g) Type-3, (h)Type-4 incoherent FFLs. The arrow sign 
denotes activation and the _L sign repression. 



networks. In the case of a GTRN, the nodes of the network represent genes and two nodes 
are connected by a directed link if the protein product of one gene regulates the synthesis 
of proteins from the other gene. Existing databases on simple organisms like E. coli and 
S. cerevisiae show that the GTRNs of these organisms have common structural motifs like 
bi-fan, single input module (SIM) and feed forward loop (FFL) [SHHE]- Such motifs are 
more abundant in the naturally occurring networks than in their randomized counterparts, 
highlighting the essential roles of motifs in network function. 

The regulatory and other biochemical processes associated with a GTRN are proba- 
bilistic in nature giving rise to fluctuations in the levels of proteins synthesized by different 
genes. The magnitude of noise cannot be neglected when the number of biomolecules par- 
ticipating in the network processes is small. Recently, several theoretical 13 IH1 13 HH] 
as well as experimental [HI [TJl HH] studies have been carried out on the origins and con- 
sequences of stochasticity and the dependence of noise on some important parameters of 
gene expression (GE) like the transcription and translation rates. The effect of stochasticity 
may be both advantageous and disadvantageous. Stochasticity can give rise to phenotypic 
variations in an identical population of cells kept in the same environment. It thus plays a 
positive role in situations where phenotypic diversity is beneficial. In most cases, however, 
stochasticity acts to diminish fidelity in cellular processes. Noisy regulatory signals, for ex- 
ample, may not achieve the desired outcome introducing uncertainty in cellular behaviour. 

Fraser et al. [14] have recently addressed the important issue of the relation of noise 
to the fitness of an organism. They estimate the noise in protein production for almost 
all the genes in S. cerevisiae and show that the amount of noise associated with protein 
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levels in the steady state has lower magnitude in the cases of essential genes and genes 
encoding subunits of multi-protein complexes. Fluctuations in the protein levels of these 
functionally important classes of genes are particularly detrimental to organismal fitness 
because of reduced functionality. The lower amounts of noise associated with the genes 
support the hypothesis that noise is an evolvable trait acted on by natural selection. In this 
paper, we consider a simple stochastic model of GE to determine the noise characteristics 
of a particular type of motif appearing in GTRNs, namely, the FFL 0111 EH- A FFL is a 
three-node motif describing three genes X, Y and Z (figure 1). The protein x produced 
from gene X regulates protein synthesis from gene Y. Proteins x and y also jointly regulate 
the expression of gene Z. Inducer molecules S x and S y are in general required to activate or 
inhibit the function of protein molecules x and y. There are three transcriptional regulatory 
interactions in a FFL, each of which can have either positive (activation) or negative 
(repression) sign. The motif with three links can be in eight possible configurations which 
fall into two categories: coherent and incoherent (figure 1). In a coherent FFL, the sign 
of the direct regulation path from X to Z is the same as the overall sign of the indirect 
regulation path via Y. There are four such configurations. In the other four configurations, 
termed incoherent FFLs, the signs of the direct and indirect regulation paths are opposite. 
The two protein inputs x and y regulate the target gene Z through either an AND-gate or 
an OR-gate. In the first case, both x and y proteins are needed to regulate gene Z and in 
the second case, either x or y protein is sufficient for the regulation of Z. The functionality 
of the different types of FFLs has been determined using a simple mathematical analysis 
based on the deterministic rate equation approach |3j. The coherent FFL is found to 
serve as a sign-sensitive delay element. Consider the Type-1 coherent FFL with AND-gate 
regulation and a step-like pulse of x proteins as the input stimulus (signal). Expression 
of gene Z can only begin when the level of y proteins is sufficient to cross the activation 
threshold for Z. The response time is a measure of the speed of response and is given by 
the time taken for the z proteins to reach an amount which is half the steady state level. 
Sign-sensitive delay implies that the response time to step-like stimuli is asymmetric, i.e, 
the response time is delayed in one direction (pulse OFF to ON) and rapid in the other 
direction (ON to OFF). As a result, if the activation of the X gene is transient, the Z gene 
cannot be significantly activated, i.e, the input signal is not transduced through the FFL. 
The z proteins are synthesized only when the X gene is activated for a sufficiently long 
time interval. The Z gene switches off rapidly once the X gene is deactivated. In other 
words, the coherent FFL functions as a persistence detector, responding only to a persistent 
stimulus and filtering out fluctuations in the input signal. The role of the coherent FFL as 
sign-sensitive delay has been verified experimentally (THJ - The incoherent FFLs function 
as sign-sensitive accelerators speeding up the response time in one direction (OFF to ON 
in the stimulus step) but not in the other direction (ON to OFF). Some incoherent FFLs 
act also as pulse generators. Amongst the coherent FFLs, the Type-1 FFL appears the 
maximum number of times in the GTRNs of E. coli and S. cerevisiae. Similarly, in the 
case of incoherent FFLs, the Type-1 FFL is the most abundant. We calculate the noise 
characteristics of the coherent and incoherent FFLs using the Langevin formalism 
and the Monte Carlo simulation technique based on the Gillespie algorithm (GA) [T7tfT8]. 



3 



We show that the most abundant coherent FFL, namely, the Type-1 FFL, is the least 
noisy. This is, however, not true in the case of the incoherent FFLs. The lower number 
of FFLs has been ascribed to their reduced functionality (4j. Noise is disadvantageous if 
it affects operational reliability. Our results on noise characteristics of FFLs suggest that 
noisy motifs are likely to be selected against during evolution if noise is detrimental to the 
function of the motifs. 



2. Stochastic Model of GE 

The simple stochastic model of GE has been studied earlier as a Markovian model for the 
gene induction process [THj and also to explore the possible origins of the genetic disorder, 
haploinsufficiency [TQl [201 - In the minimal model, a gene can be in two possible states: 
inactive (G) and active (G*). Due to stochasticity, the gene makes random transitions 
between the inactive and active states with k a and kd being the activation and deactivation 
rate constants. In the active state, protein production occurs with the rate constant j3 p . 
Protein decay occurs with the rate constant 7 P . The protein decay rate has two components, 
one, the degradation rate and the other, the dilution rate of proteins due to cell growth 
and division. The reaction scheme RS-1 is shown in equation (|T}, 

ka ftp 

G ^ G* — > p — > $ (1) 
k d 

Let P(ni, ri2, t) be the probability that at time t, m genes are in the active state G* and 
the number of protein molecules is n 2 . The rate of change of the probability with respect 
to time is given by the Master Equation 

dP(nun 2 ,t) _ £.j( ntot - Ul + l)P(m - 1, n 2 , t) - (n tot - n x )P{n u n 2 , t)) 

+k d [(n 1 + l)P(ni + 1, n 2 , t) - n 1 P(n 1 , n 2 , t)) ^ 

+/3 p [n 1 P(n 1 , n 2 -l,t)- niP(n x , n 2 , t)} 
+7 P [(^2 + l)P(ni, n 2 + 1, t) - n 2 P(ni, n 2 , t)} 

where n to t is the total number of genes. 

For each rate constant, the gain term adds to the probability and the loss term subtracts 
from the same. The simplicity of the stochastic model enables one to calculate the mean 
protein level < n 2 > and its variance < 5n 2 >=< n 2 > — < n 2 > 2 in the steady state 
using the standard generating function approach. The results are: 

< n >— — ntot ^ a (3) 
7p k a + kd 

< Snj >=< n 2 >[l + , ^ kd -) (4) 

2 1 (k a + kd)(k a + k d + 7 P y yi 
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Also, the mean number of genes in the active state is given by 

< n >— ntot ^ a (5) 
k a + kd 

The minimal model (equation (JH)) describes constitutive GE. We now assume that the 
transition from the state G to the state G* is brought about by activating regulatory 
molecules S. The reaction scheme RS-2 in the presence of such molecules is given by 

h k a (3 P 7 P 

G + S ^ GS ^ G* — ► p — > $ (6) 
k 2 k d 

where G-S represents the bound complex of G and S from which transition to the active 
state G* occurs. The total number of genes n to t is given by 

ntot = 9 + 9s + 9* (7) 

where g, g s and g* are the number of genes in the states G, G_S and G* respectively. In 
the steady state, ^ = and = 0. From the first condition, one obtains 

Q s 

k =3 - (8) 

where Ki = |^ is the equilibrium dissociation constant and s is the number of regulatory 
molecules. From the second condition, the expression for g* in the steady state is given by 

, 8/K! 

„* _ ,L tot^a l+s/Kl ^ 



Expressions © and for the number of genes in the active state G* are equivalent on 
defining effective activation and deactivation rate constants 

^ = k * i irh k ' d = kd (10) 

1 + S/Aj 

The equivalence relations are useful as one can map the reaction scheme RS-2 onto the 
simpler scheme RS-1 while calculating mean protein levels and the associated variances. 
Regulatory molecules, in general, oligomerise to form an active complex S n where n is the 
number of regulatory molecules contained in the complex. In this case, the effective rate 
constants k a and k d are given by 

// _ h i s l K T k'-k, mi 

a ~ a l + ( s /K) n) K d~ K d UiJ 

where K n = K 1 K C , K c being the equilibrium dissociation constant for oligomerisation, 
i. e, 
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c 



nS ^ S n (12) 

When the regulatory molecules S act as repressors, the effective rate constants are given 
by 

ka = ka l + (s/K) n ' kd = kd ^ 

In this case, repressor molecules on binding to genes prevent their activation to the state 
G*. 

We now apply the stochastic model of GE to determine the mean levels of proteins x, 
y and z and the variances thereof in the steady state of a FFL. The variances calculated 
are a measure of the intrinsic noise associated with GE as fluctuations in the number of 
regulatory molecules are ignored. Let fa and ji (i = x, y, z) be the rate constants for 
the synthesis and decay respectively of protein i. For proteins x, the mean protein level 
x av and its variance < 5x 2 > in the steady state are obtained from equations (j3J) and (J3J 
[HUlin] as (with n tot = 1) 

x av =< x >= — (14) 
<6x 2 >=<x> [1 + - , . kd , -] (15) 

(k a + k d )(k a + k d + lx y 

where k a and kd are activation and deactivation rate constants of gene X. Protein molecules 
x regulate the activation of gene Y according to the reaction scheme RS-2. Mapping onto 
the simpler reaction scheme RS-1, one obtains in the steady state 

«- =< ^T,A, ,16) 

< tf >;<'> (17) 

The effective rate constants k a and k d have the forms given in equations (|TT| or (|T3|) 
depending on whether the regulatory interaction is activating or repressing in nature. In 
the case of activation, assuming n to be 2, 



k a — kay Z TTTt? \2i ~ kdy (18) 



(x/K x . " 2 
1 + (x/ K xyJ 

In (|THj) . k ay represents the limiting value of k' a obtained when S> 1. In the case of 
repression, 

k ' a = Ky \ + {x l /K xy r k ' d = kdy (19) 
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Both the x and y proteins regulate the activation of the Z gene. The mapping of the 
associated reaction scheme onto the simpler reaction scheme RS-1 is still possible. The 
effective rate constants k a and k d have specific forms depending on the nature of the 
regulating interaction (activating/ repressing) and the type of logic gate (AND/ OR) in 
operation. The mean protein level in the steady state and its variance are 

k" 

z av =<z>='^ ° (20) 
lz k a + k d 

K fc2 ~ :>ll+ k + k)kU + ^ ] (21) 

The activation and deactivation rate constants k a and k d are 

ka = k-az G(x,y,T xz ,T yz ), k d = k& z (22) 
where k az is the limiting value of k" a . For the AND-gate, 

G(x, y, T xz j Tyz) = T xz T yz (23) 

For the OR-gate, 

fit rp rp \ _ {l + {x/K xz f)T xz + {l + {y/K yz f)T yz 

{ ' v* 1 *" 1 **)- 1 + ( X /K xz y + ( y /K yz y [M} 

This expression has been derived assuming that the regulatory molecules x and y compete 
to bind at the operator region of the gene Z, as in Ref. [1]. 
For activating regulatory interactions, 

(x/K xz f {y/K yz f 

Ixz i + (x/K xz y> Iyz i + (y/K yz y 

For repressing regulatory interactions, 

1 1 

Txz = i + (x/K xz y ' Tyz = i + ( y /K yz y (26) 

The parameters K xy , K yz and K xz appearing in ifTTj) . (jZHj) and (|26|l are analogous to the 
parameter K in ((TTJ) . In the steady state of the FFL, all three proteins x, y, z are in their 
steady state levels and the effective rate constants k a , k a are calculated with the steady 
state values x = x av and y = y av . 

The FFL may be considered to be a two-step signaling cascade. The x and z proteins 
constitute respectively the input and output signals of the cascade. With stochasticity 
taken into account, it is desirable that cascades are able to transmit signals in a reliable 
manner. When fluctuations are considerable, there is a danger of the noise building up in 
successive steps of the cascade corrupting the final output signal. Thattai and Oudenaarden 
[TH] have studied the noise characteristics of signaling cascades and have shown that under 
certain conditions the fluctuations in the output signal are bounded. Also, noise reduction 
is possible, i.e, the output signal is less noisy than the input signal. 
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3. Noise Characteristics of FFL 



The variance around the mean protein level has two components: intrinsic and extrinsic. 
In the last section, the variance due to only the intrinsic part has been calculated. In this 
section, the fluctuations in the number of regulatory molecules, constituting extrinsic noise, 
are taken into account. The total variances in the steady state of the FFL are denoted 
as < 5x 2 >tot (equals < 5x 2 > given in (15)), < 5y 2 > to t and < 5z 2 >tot- The variances 
can be calculated using the method followed in jTH]. We use Langevin equations to take 
stochasticity into account. The equation describing the production of protein x is given by 

i = (3x Trrr-7* x + vi(t) (27) 

where x represents a time derivative. Stochasticity is associated with the time-dependent 
noise term rji(t) in equation (27). The random variable T}i(t) obeys white-noise statistics, 
i.e, 

< 771(f) >=0, <Vl (t)r h (t + r) >=qi5(r) (28) 

where 5(t) is the Dirac delta function and < ... > denotes an ensemble average. The state 
dependences of 771 (x,t) and q(x) are ignored since we are interested in the steady state 
noise characteristics. In the absence of the noise term in equation (|27[). the mean protein 
level in the steady state (x(t) = 0), as in equation (JTJJ, is recovered. We linearize equation 
(|27jl for fluctuations, assumed to be small, about the steady state to obtain 

5x(t)+7 x 5x = r h (t) (29) 
Fourier transform of equation (29) yields 

+ 7s) 5x(u) = 771(0;) (30) 
Next, taking ensemble average and applying condition (28), we get 

< \5x(u)\ 2 >= (31) 

The steady state variance < 5x 2 > to t is given by an inverse Fourier transform at r = 0, i.e, 

< Sx 2 > tot = (32) 

2 7rr 

Since < 5x 2 >tot=< 5x 2 > (equation (jTHl) ). q 1 is known explicitly from equation (I32J) . For 
protein y, the Langevin equation is given by 

y + i y y = f3 y fxy(x) + 772(0 (33) 

with 

<V2(t)7 l2 (t + r) >=q 2 5(r) (34) 
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In equation (|33|) , the rate of creation of y proteins in terms of the x proteins is given by 
the first term on the r.h.s. The function f xy (x) is designated as the transfer function and 
is given by 



k' 



x 



k„ + k r] 



(35) 



where k' a and k' d are as defined in equations (|T7fl) and (JTHJ) - Again, the mean protein level 
in the steady state, y av (equation JUJ) can be recovered from equation (f33|) by ignoring 
the noise term and putting y = 0. Going through the same steps as before, the variance 
< Sy 2 >tot is obtained as 



< 5y 2 > 



Q2 



Pi 4 qi 



tot- 



(36) 



27y 2lxly(lx+ly) 

The first term in equation (36) is the intrinsic noise term given by < by 2 > (equation 
ifTTjl ). The second term, describing extrinsic noise, arises due to the noise propagated from 
the input, i.e, due to the fluctuations in the number of x regulatory proteins. In the same 
equation, c x is the derivative of the transfer function f x 
steady state value of x, i.e 



y\X), w.r.t x, calculated at the 



9 fxy (•£ 

OX 

For the z proteins, the Langevin equation is 



x 



Xn 



z + lzZ = f3 z g xy (x, y) + r} 3 (t) 



(37) 



(3f 



with 



< rj 3 (t) rj 3 (t + t) >= q 3 S(r) 
The transfer function g xy (x,y) is given by 



9x V (x,y) 



k„ + k. 



(39) 



(40) 



where k a and k' d have been defined in equations (|22|) - (|26|) . The variance < Sz 2 > tot is given 
by 



< Sz 2 > = 93 I 4 , 91 01 4 , 91 0j 4 4 (7*+7»/+7*) 

tot 2 7z 2 ly 7z ( 7j/ +7z ) 2 7s 7 Z (7a +7 Z ) 2 7^ 7 a 7 Z (7^ +-y y ) (-y y +7 Z ) (j x +7z ) 

(41) 

I Ql I3 y 0j -y y c x d x d y (■y x +f y +-y z ) 
7* 7y 7z (7:r+7!/)(7!/+7z)(7^+7z) 

where 
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dg xy (x,y) 
dx 



x 



X n 



V = Va 



d i , 



dg xy (x,y) 
dy 



x 



X n 



y = y a 



(42) 



In equation (|39|) . the first term j^- is the intrinsic noise term given by < 5z 2 > (equation 
(J2IJ)). The other terms represent noise propagated from the earlier stages, i.e, occur due 
to fluctuations in the number of x and y regulatory molecules. These terms describe the 
extrinsic noise. 



4. Results and Discussion 

We now calculate the variances < 5x 2 >tot, < by 2 >tot and < 5z 2 > tot for the different 
FFLs. Our goal is to compare the variances for the same as well as different FFLs. For 
simplicity we assume that all 7i's (i =x, y, z)=l and K xy = K yz = K xz = 1. The mean 
levels of proteins x, y and z in the steady state are kept the same in all the cases so that a 
meaningful comparison between the variances can be made. Figures 2 and 3 show the plots 
of < 5x 2 > tot (line with long dashes), < 5y 2 > to t (line with short dashes) and < 5z 2 > tot 
(solid line) versus (3 y for the coherent and incoherent FFLs respectively. The regulation of 
the Z gene by the x and y proteins is achieved via the AND gate. The plots have been 
obtained keeping the mean protein levels x av , y av and z av fixed at m — 5.0. For this we put 

(3 X = I3 Z = 10, k a = k d = 20, k" a = k az = 20, k' a = -P^-, k' d = 20, (43) 

p y — m 

For the coherent Type-1 FFL with AND-gate regulation, the values of k ay and k' a are fixed 
from the relations 

2 

t = **T^? <44) 



and 



4 

m 



k a — k az (45) 
(1 + m z ) z 

Equivalent relations hold true for the other types of FFLs. An examination of figure 2 
shows that the Type-1 coherent FFL is the least noisy amongst all the coherent FFLs. 
The number of times the Type-1, Type- 2, Type-3 and Type-4 coherent FFLs appear in 
the GTRNs of E. coli (S. cerevisiae) are 28, 2, 4, 1 (26, 5, 0, 0) 0. The most abundant 
coherent FFL, namely the Type-1 FFL, is the least noisy. This is not true for the incoherent 
FFLs. The number of times the Type-1, Type-2, Type-3, Type-4 incoherent FFLs appear 
in the GTRNs of E. coli (S. cerevisiae) are 5, 0, 1, 1 (21, 3, 1, 0). The most abundant 
incoherent FFL, namely, the Type-1 FFL, is more noisy than, say, the Type-4 incoherent 
FFL, which is practically absent in the GTRNs. 

The reasons as to why some FFLs occur more often than the others in GTRNs, are not 
well understood. Generally speaking, reduced functionality of a motif may be a possible 
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Figure 2: Variances < 5x 2 > to t (line with long dashes), < 5y 2 > to t (line with short dashes), 
< 5z 2 >tot (solid line) versus j3 y for (a) Type-1, (b) Type-2, (c) Type-3 and (d) Type-4 
coherent FFLs controlled by AND-gate. The mean protein level is fixed at m=5. The 
other parameter values are mentioned in the text 

reason for its lower abundance, i.e, being selected against during evolution. As suggested 
by Mangan and Alon |3], for AND-gate FFLs, Types-3 and 4 have reduced functionality 
compared to Types- 1 and 2, as the former respond to at most one input stimulus (S x ) 
whereas the latter respond to both the input stimuli S x and S y . Also, Type-1 coherent 
FFL gains advantage from increased cooperativity leading to a sharper response in the 
presence of stimuli. For low x concentrations, the effective Hill coefficient (a measure of 
cooperativity) is 6 (for n = 2 in equation (12)) whereas the same, for the other FFLs, 
is 2. We now discuss the relationship between noise, function and abundance. For the 
sake of clarity, we focus attention on the Type-1 and Type-4 coherent FFLs. Figure 4 
shows plots for the total variances around the mean protein level m = 5 when the input 
noise < 5x 2 > to t is higher than that in the cases of figures 2 and 3. The parameter values 
changed from equation (43) are k a = kd = 5, k d = 30 and k az = 30. In the case of the 
Type-1 coherent FFL, one finds the existence of a parameter region in which the variance 
decreases in the successive stages of the FFL so that the output noise is less than the input 
noise. Such a parameter region is absent in the case of the Type-4 coherent FFL. Another 
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notable feature of the plots in figures 2 and 4 is that < by 2 > to t and < 5z 2 > tot in the 
case of the Type-1 FFL have almost linear dependences on (3 y whereas the same quantities 
are more nonlinear in the case of the Type-4 FFL. For the Type-1 FFL, the dominant 
contribution to < 5z 2 > tot is from the internal noise associated with the expression of the 
Z gene. Fluctuations in the x and y protein levels have little effect on the total noise. In 
the case of the Type-4 FFL, the extrinsic contribution to noise is greater than that in the 
case of the Type-1 FFL. In short, figures 2 and 4 show that the Type-1 FFL acts as a 
better filter of noise. As mentioned in the Introduction, one possible function of coherent 
FFLs is as a persistent detector or equivalently as a filter which attenuates the input noise. 
The Type-1 coherent FFL being less noisy than the Type-4 coherent FFL, functions better 
as a noise filter. The reduced functionality of the Type-4 coherent FFL explains its lower 
abundance from an evolutionary point of view. Similar reasoning holds true for Type-2 
and Type-3 coherent FFLs. Thus for coherent FFLs, noise is disadvantageous as it erodes 
the function of a FFL as a persistent detector. For incoherent FFLs, functioning as sign- 
sensitive accelerators, noise appears to have no direct relationship with abundance, i.e, 
noise is not detrimental to the functioning of the FFLs. 

Our analysis of the noise characteristics of FFLs is based on the Langevin formalism 
which is approximate in nature. To establish the validity of the results, we have calculated 
the variances using Monte Carlo simulation based on the GA [TH| - The GA provides a 
numerical solution of the Master Equation leading to an accurate description of the time 
course of evolution of a stochastic system. A brief description of the GA is as follows. Con- 
sider N chemical species participating in M chemical reactions. Let X(i), i = 1,2, 3, , N 

denotes the number of molecules of the ith chemical species. Given the values of X(i), 
i = 1,2,3, ...N at time t, the GA is designed to answer two questions: (1) when will the 
next reaction occur? and (2) what type of reaction will it be? Let the next reaction occur 
at time t + t. Knowing the type of reaction, one can adjust the numbers of participating 
molecules in accordance with the specific reaction scheme. Thus, with repeated applica- 
tions of the GA, one can keep track of how the numbers, X(i)'s, change as a function of 
time due to the occurrence of M different types of chemical reactions. Each reaction \x 
(/i = 1, 2, 3, M) has a stochastic rate constants associated with it. The rate constant 
has the interpretation that C^dt is the probability that a particular combination of reacting 
molecules participates in the nth reaction in the infinitesimal time interval (t,t + dt). If 
is the number of distinct molecular combinations for the //t/i reaction, then a^dt = h^C^dt 
is the probability that the \ith reaction occurs in the infinitesimal time interval (t, t+dt). 
The implementation of the GA algorithm is described in detail in Refs. [EJ HE]- We use 
the algorithm to determine the evolution of the number of z proteins of a FFL as a function 
of time. Figures 5(a) and 6(a) show the results for the coherent Type-1 and Type-4 FFL 
respectively. The solid line, in each case, represents the mean trajectory obtained from 
a solution of the deterministic equations. The reactions considered are those associated 
with a FFL. Expression of each gene X , Y and Z is according to the reaction scheme 
RS-2 (equation (6)). For the X gene, there is no regulatory molecule S. The x proteins 
dimerize (equation (12) with n = 2) and the dimers regulate expression of the Y gene. 
The y proteins also dimerize to regulate expression of the gene Z . Considering AND-gate 
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Figure 3: Variances < 5x 2 > tot (line with long dashes), < 5y 2 > tot (line with short dashes), 
< 5z 2 > tot (solid line) versus (3 y for (a) Type-1, (b) Type-2, (c) Type-3 and (d) Type-4 
incoherent FFLs controlled by AND-gate. The mean protein level is fixed at m=5. The 
other parameter values are mentioned in the text. 

regulation of the Z gene expression, both the x and y protein dimers bind simultaneously 
at the operator region for activation of the gene. Other possibilities like the operator region 
unoccupied or occupied by a single dimer are considered but the gene remains in the inac- 
tive state in these cases. The stochastic rate constants C M 's are equal to the rate constants 
k^s since in the deterministic approach the numbers and not the concentrations of the 
different molecules are considered. Figure 5(b) and 6(b) show the histograms describing 
the distribution of protein levels, N(z) versus z, for the coherent Type-1 and Type-4 FFLs 
respectively. The histograms have been obtained by accumulating data over 5000 trial 
runs. The distribution is broader in the case of the Type-4 coherent FFL indicating that 
it is more noisy than the Type-1 FFL. The variances for Type-1 and Type-4 distributions 
are 110.612 and 329.990 respectively. The simulation results support the results obtained 
by using the Langevin formalism that the Type-4 coherent FFL is more noisy than the 
Type-1 coherent FFL. 
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Figure 4: Variances < 5x 2 > to t (line with long dashes), < 5y 2 > to t (line with short dashes), 
< 5z 2 >tot (solid line) versus j3 y for (a) Type-1 coherent FFL and (b) Type-4 coherent 
FFL controlled by AND-gate. The mean protein level is fixed at m=5. The input noise is 
greater than that in the case of figure 2. 

5. Conclusion and Outlook 

In this paper, we have studied the noise characteristics of coherent and incoherent FFLs 
using the Langevin formalism as well as a numerical simulation technique based on the 
Gillespie algorithm. Noise is undesirable if it affects operational reliability. Coherent 
FFLs function as noise filters and the performance of the Type-1 FFL is found to be the 
best since the propagation of noise associated with the input signal is the least in this 
case. The coherent Type-1 FFL is the most abundant of FFL motifs appearing in the 
GTRNs of simple organisms. The functional superiority of the Type-1 FFL, amongst the 
four coherent FFLs, is the main reason why the particular motif is favoured by natural 
selection. Mangan and Alon |3] have speculated that increased effective cooperativity 
of the Type-1 FFL might be responsible for its evolutionary advantage. Thattai and 
van Oudenaarden (THj have shown that increased cooperativity leads to noise reduction. 
This possibly explains why the Type-1 coherent FFL has less output noise than the other 
coherent FFLs. For the incoherent FFLs, no clear conclusion regarding the role of noise 
can be arrived at as concrete results are lacking. Noise may be advantageous to function 
in certain cases. Stochastic resonance is a phenomena in which noise in threshold systems 
facilitates detection of subthreshold signals f2l|. In stochastic focusing, fluctuations (noise) 
sharpen the response to an input signal, i.e, make a graded response mechanism work more 
like a threshold one [22j. Further studies are needed to ascertain whether noise aids the 
function of incoherent FFLs in some manner similar to stochastic focusing. If this is true, 
then the most abundant motif need not be the least noisy. Regulatory cascades of which 
the FFL is a special case can exhibit interesting kinetic phenomena which include even 
transient ones like pulse generation OEl]. It will be of considerable interest to determine 
the effect of noise on such phenomena. 

Fraser et al. [H] have addressed the question of whether noise associated with GE 
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Time (t) z 

(a) (b) 

Figure 5: (a) The number of proteins z(t) as a function of time t for the Type-1 coherent 
FFL. The time trajectory is obtained using the GA. The solid line determines the mean 
curve, (b) Histogram describing the distribution of protein levels (N(z) versus z) in the 
steady state. 




Time (t) z 

(a) (b) 



Figure 6: (a) The number of proteins z(t) as a function of time t for the Type-4 coherent 
FFL. The time trajectory is obtained using the GA. The solid line determines the mean 
curve, (b) Histogram describing the distribution of protein levels (N(z) versus z) in the 
steady state. 



15 



has any significant effect on the fitness of an organism. They have estimated the noise in 
protein production of almost all the S. cerevisiae genes using an experimentally verified 
model of stochastic GE. Their major finding is that noise is minimized in the cases of genes 
for which it is likely to be most harmful. These genes include essential genes, i.e, genes 
whose deletion is lethal to the organism and genes which synthesize the subunits of multi- 
protein complexes. Both types of genes are expected to be sensitive to noise. For essential 
genes, fluctuations in protein levels may have considerable effect on functional viability 
if the levels fall below the threshold required for normal cellular activity. Similarly, in 
the case of a multy-protein complex, fluctuations in the amounts of protein subunits may 
hinder the appropriate assembly of the entire complex. The observations of Fraser et al. 
are in agreement with our results on coherent FFLs. Since noise has a deleterious effect 
on the function of a coherent FFL as a persistence detector, it is minimized in the case of 
the best performing Type-1 FFL. 
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